Overview
Brought to you by YData
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 3110 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 464 |
| Duplicate rows (%) | 14.9% |
| Total size in memory | 340.2 KiB |
| Average record size in memory | 112.0 B |
Variable types
| Categorical | 1 |
|---|---|
| Numeric | 12 |
| Dataset has 464 (14.9%) duplicate rows | Duplicates |
alcohol is highly overall correlated with density | High correlation |
chlorides is highly overall correlated with density and 4 other fields | High correlation |
density is highly overall correlated with alcohol and 3 other fields | High correlation |
fixed acidity is highly overall correlated with chlorides and 2 other fields | High correlation |
free sulfur dioxide is highly overall correlated with total sulfur dioxide | High correlation |
residual sugar is highly overall correlated with type | High correlation |
sulphates is highly overall correlated with type | High correlation |
total sulfur dioxide is highly overall correlated with chlorides and 2 other fields | High correlation |
type is highly overall correlated with chlorides and 6 other fields | High correlation |
volatile acidity is highly overall correlated with chlorides and 1 other fields | High correlation |
citric acid has 138 (4.4%) zeros | Zeros |
Reproduction
| Analysis started | 2024-10-31 21:23:11.384689 |
|---|---|
| Analysis finished | 2024-10-31 21:23:24.540744 |
| Duration | 13.16 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
type
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 48.6 KiB |
| Moscatel | |
|---|---|
| Syrah |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 6.5318328 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Moscatel |
|---|---|
| 2nd row | Moscatel |
| 3rd row | Moscatel |
| 4th row | Moscatel |
| 5th row | Moscatel |
Common Values
| Value | Count | Frequency (%) |
| Moscatel | 1588 | |
| Syrah | 1522 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| moscatel | 1588 | |
| syrah | 1522 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3110 | |
| M | 1588 | |
| s | 1588 | |
| o | 1588 | |
| c | 1588 | |
| t | 1588 | |
| e | 1588 | |
| l | 1588 | |
| S | 1522 | |
| y | 1522 | |
| Other values (2) | 3044 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 20314 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 3110 | |
| M | 1588 | |
| s | 1588 | |
| o | 1588 | |
| c | 1588 | |
| t | 1588 | |
| e | 1588 | |
| l | 1588 | |
| S | 1522 | |
| y | 1522 | |
| Other values (2) | 3044 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 20314 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 3110 | |
| M | 1588 | |
| s | 1588 | |
| o | 1588 | |
| c | 1588 | |
| t | 1588 | |
| e | 1588 | |
| l | 1588 | |
| S | 1522 | |
| y | 1522 | |
| Other values (2) | 3044 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 20314 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 3110 | |
| M | 1588 | |
| s | 1588 | |
| o | 1588 | |
| c | 1588 | |
| t | 1588 | |
| e | 1588 | |
| l | 1588 | |
| S | 1522 | |
| y | 1522 | |
| Other values (2) | 3044 |
fixed acidity
Real number (ℝ)
High correlation 
| Distinct | 90 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.3284566 |
| Minimum | 3.8 |
|---|---|
| Maximum | 15.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 3.8 |
|---|---|
| 5-th percentile | 5.6 |
| Q1 | 6.4 |
| median | 7 |
| Q3 | 7.9 |
| 95-th percentile | 10.4 |
| Maximum | 15.9 |
| Range | 12.1 |
| Interquartile range (IQR) | 1.5 |
Descriptive statistics
| Standard deviation | 1.4523095 |
|---|---|
| Coefficient of variation (CV) | 0.19817399 |
| Kurtosis | 1.9929427 |
| Mean | 7.3284566 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | 1.2753696 |
| Sum | 22791.5 |
| Variance | 2.1092028 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.6 | 155 | 5.0% |
| 6.8 | 149 | 4.8% |
| 6.4 | 143 | 4.6% |
| 6.7 | 119 | 3.8% |
| 6 | 114 | 3.7% |
| 7.2 | 114 | 3.7% |
| 7 | 114 | 3.7% |
| 7.1 | 111 | 3.6% |
| 6.9 | 103 | 3.3% |
| 6.5 | 100 | 3.2% |
| Other values (80) | 1888 |
| Value | Count | Frequency (%) |
| 3.8 | 1 | < 0.1% |
| 3.9 | 1 | < 0.1% |
| 4.4 | 3 | 0.1% |
| 4.6 | 1 | < 0.1% |
| 4.7 | 6 | 0.2% |
| 4.8 | 7 | 0.2% |
| 4.9 | 5 | 0.2% |
| 5 | 19 | |
| 5.1 | 14 | |
| 5.2 | 16 |
| Value | Count | Frequency (%) |
| 15.9 | 1 | < 0.1% |
| 13.8 | 1 | < 0.1% |
| 13.3 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12.8 | 3 | |
| 12.7 | 4 | |
| 12.6 | 3 | |
| 12.5 | 6 | |
| 12.4 | 3 | |
| 12.3 | 1 | < 0.1% |
volatile acidity
Real number (ℝ)
High correlation 
| Distinct | 171 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.40281833 |
| Minimum | 0.085 |
|---|---|
| Maximum | 1.58 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 0.085 |
|---|---|
| 5-th percentile | 0.17 |
| Q1 | 0.26 |
| median | 0.35 |
| Q3 | 0.53 |
| 95-th percentile | 0.74775 |
| Maximum | 1.58 |
| Range | 1.495 |
| Interquartile range (IQR) | 0.27 |
Descriptive statistics
| Standard deviation | 0.18984357 |
|---|---|
| Coefficient of variation (CV) | 0.47128832 |
| Kurtosis | 1.1248979 |
| Mean | 0.40281833 |
| Median Absolute Deviation (MAD) | 0.12 |
| Skewness | 1.0170516 |
| Sum | 1252.765 |
| Variance | 0.036040583 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.28 | 118 | 3.8% |
| 0.24 | 102 | 3.3% |
| 0.22 | 97 | 3.1% |
| 0.27 | 88 | 2.8% |
| 0.26 | 87 | 2.8% |
| 0.32 | 86 | 2.8% |
| 0.3 | 85 | 2.7% |
| 0.31 | 84 | 2.7% |
| 0.36 | 79 | 2.5% |
| 0.29 | 70 | 2.3% |
| Other values (161) | 2214 |
| Value | Count | Frequency (%) |
| 0.085 | 1 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.105 | 4 | 0.1% |
| 0.11 | 5 | 0.2% |
| 0.12 | 10 | 0.3% |
| 0.13 | 8 | 0.3% |
| 0.14 | 15 | 0.5% |
| 0.145 | 2 | 0.1% |
| 0.15 | 31 | |
| 0.16 | 48 |
| Value | Count | Frequency (%) |
| 1.58 | 1 | |
| 1.33 | 2 | |
| 1.24 | 1 | |
| 1.185 | 1 | |
| 1.18 | 1 | |
| 1.13 | 1 | |
| 1.115 | 1 | |
| 1.1 | 1 | |
| 1.09 | 1 | |
| 1.07 | 1 |
citric acid
Real number (ℝ)
Zeros 
| Distinct | 83 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.28375884 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 138 |
| Zeros (%) | 4.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.01 |
| Q1 | 0.2 |
| median | 0.28 |
| Q3 | 0.36 |
| 95-th percentile | 0.54 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.16 |
Descriptive statistics
| Standard deviation | 0.15469282 |
|---|---|
| Coefficient of variation (CV) | 0.54515594 |
| Kurtosis | 0.51139828 |
| Mean | 0.28375884 |
| Median Absolute Deviation (MAD) | 0.08 |
| Skewness | 0.30860124 |
| Sum | 882.49 |
| Variance | 0.023929868 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.28 | 152 | 4.9% |
| 0.3 | 143 | 4.6% |
| 0 | 138 | 4.4% |
| 0.26 | 130 | 4.2% |
| 0.32 | 126 | 4.1% |
| 0.27 | 123 | 4.0% |
| 0.24 | 114 | 3.7% |
| 0.29 | 109 | 3.5% |
| 0.25 | 88 | 2.8% |
| 0.33 | 87 | 2.8% |
| Other values (73) | 1900 |
| Value | Count | Frequency (%) |
| 0 | 138 | |
| 0.01 | 37 | 1.2% |
| 0.02 | 52 | 1.7% |
| 0.03 | 30 | 1.0% |
| 0.04 | 32 | 1.0% |
| 0.05 | 21 | 0.7% |
| 0.06 | 26 | 0.8% |
| 0.07 | 20 | 0.6% |
| 0.08 | 33 | 1.1% |
| 0.09 | 37 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0.91 | 2 | |
| 0.86 | 1 | < 0.1% |
| 0.82 | 1 | < 0.1% |
| 0.79 | 1 | < 0.1% |
| 0.78 | 2 | |
| 0.76 | 1 | < 0.1% |
| 0.75 | 1 | < 0.1% |
| 0.74 | 4 | |
| 0.73 | 4 |
residual sugar
Real number (ℝ)
High correlation 
| Distinct | 227 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4713826 |
| Minimum | 0.7 |
|---|---|
| Maximum | 22.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 0.7 |
|---|---|
| 5-th percentile | 1.2 |
| Q1 | 1.9 |
| median | 2.4 |
| Q3 | 6.075 |
| 95-th percentile | 13.7 |
| Maximum | 22.6 |
| Range | 21.9 |
| Interquartile range (IQR) | 4.175 |
Descriptive statistics
| Standard deviation | 4.0701145 |
|---|---|
| Coefficient of variation (CV) | 0.91025861 |
| Kurtosis | 1.7679741 |
| Mean | 4.4713826 |
| Median Absolute Deviation (MAD) | 0.8 |
| Skewness | 1.616036 |
| Sum | 13906 |
| Variance | 16.565832 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 187 | 6.0% |
| 1.8 | 154 | 5.0% |
| 2.1 | 140 | 4.5% |
| 1.9 | 138 | 4.4% |
| 2.2 | 137 | 4.4% |
| 2.3 | 120 | 3.9% |
| 2.4 | 104 | 3.3% |
| 2.5 | 100 | 3.2% |
| 1.6 | 96 | 3.1% |
| 1.7 | 94 | 3.0% |
| Other values (217) | 1840 |
| Value | Count | Frequency (%) |
| 0.7 | 2 | 0.1% |
| 0.8 | 5 | 0.2% |
| 0.9 | 16 | 0.5% |
| 1 | 29 | 0.9% |
| 1.1 | 52 | |
| 1.15 | 1 | < 0.1% |
| 1.2 | 74 | |
| 1.3 | 55 | |
| 1.4 | 84 | |
| 1.45 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 22.6 | 1 | < 0.1% |
| 20.3 | 1 | < 0.1% |
| 19.95 | 1 | < 0.1% |
| 19.4 | 1 | < 0.1% |
| 19.3 | 3 | |
| 19.25 | 2 | |
| 18.75 | 1 | < 0.1% |
| 18.5 | 1 | < 0.1% |
| 18.4 | 1 | < 0.1% |
| 18.35 | 1 | < 0.1% |
chlorides
Real number (ℝ)
High correlation 
| Distinct | 191 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.065720579 |
| Minimum | 0.009 |
|---|---|
| Maximum | 0.611 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 0.009 |
|---|---|
| 5-th percentile | 0.03 |
| Q1 | 0.042 |
| median | 0.057 |
| Q3 | 0.08 |
| 95-th percentile | 0.11355 |
| Maximum | 0.611 |
| Range | 0.602 |
| Interquartile range (IQR) | 0.038 |
Descriptive statistics
| Standard deviation | 0.042082407 |
|---|---|
| Coefficient of variation (CV) | 0.64032313 |
| Kurtosis | 42.888796 |
| Mean | 0.065720579 |
| Median Absolute Deviation (MAD) | 0.019 |
| Skewness | 5.130326 |
| Sum | 204.391 |
| Variance | 0.001770929 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.036 | 76 | 2.4% |
| 0.048 | 75 | 2.4% |
| 0.044 | 74 | 2.4% |
| 0.05 | 69 | 2.2% |
| 0.08 | 65 | 2.1% |
| 0.047 | 63 | 2.0% |
| 0.042 | 61 | 2.0% |
| 0.041 | 56 | 1.8% |
| 0.076 | 55 | 1.8% |
| 0.04 | 54 | 1.7% |
| Other values (181) | 2462 |
| Value | Count | Frequency (%) |
| 0.009 | 1 | < 0.1% |
| 0.012 | 2 | 0.1% |
| 0.013 | 1 | < 0.1% |
| 0.014 | 2 | 0.1% |
| 0.015 | 4 | |
| 0.016 | 1 | < 0.1% |
| 0.017 | 3 | |
| 0.018 | 4 | |
| 0.019 | 1 | < 0.1% |
| 0.02 | 6 |
| Value | Count | Frequency (%) |
| 0.611 | 1 | < 0.1% |
| 0.61 | 1 | < 0.1% |
| 0.467 | 1 | < 0.1% |
| 0.464 | 1 | < 0.1% |
| 0.422 | 1 | < 0.1% |
| 0.415 | 3 | |
| 0.414 | 2 | |
| 0.413 | 1 | < 0.1% |
| 0.403 | 1 | < 0.1% |
| 0.401 | 1 | < 0.1% |
free sulfur dioxide
Real number (ℝ)
High correlation 
| Distinct | 99 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.665916 |
| Minimum | 1 |
|---|---|
| Maximum | 289 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 12 |
| median | 23 |
| Q3 | 35 |
| 95-th percentile | 57 |
| Maximum | 289 |
| Range | 288 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 17.412649 |
|---|---|
| Coefficient of variation (CV) | 0.67843473 |
| Kurtosis | 17.894867 |
| Mean | 25.665916 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 2.056841 |
| Sum | 79821 |
| Variance | 303.20035 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 138 | 4.4% |
| 5 | 104 | 3.3% |
| 15 | 97 | 3.1% |
| 17 | 87 | 2.8% |
| 12 | 86 | 2.8% |
| 10 | 83 | 2.7% |
| 16 | 83 | 2.7% |
| 26 | 80 | 2.6% |
| 29 | 78 | 2.5% |
| 21 | 78 | 2.5% |
| Other values (89) | 2196 |
| Value | Count | Frequency (%) |
| 1 | 3 | 0.1% |
| 2 | 2 | 0.1% |
| 3 | 51 | 1.6% |
| 4 | 42 | 1.4% |
| 5 | 104 | |
| 5.5 | 1 | < 0.1% |
| 6 | 138 | |
| 7 | 75 | |
| 8 | 63 | |
| 9 | 66 |
| Value | Count | Frequency (%) |
| 289 | 1 | < 0.1% |
| 124 | 1 | < 0.1% |
| 112 | 1 | < 0.1% |
| 108 | 3 | |
| 105 | 2 | |
| 101 | 2 | |
| 98 | 3 | |
| 97 | 1 | < 0.1% |
| 87 | 2 | |
| 81 | 3 |
total sulfur dioxide
Real number (ℝ)
High correlation 
| Distinct | 227 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88.464148 |
| Minimum | 6 |
|---|---|
| Maximum | 440 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 14 |
| Q1 | 38 |
| median | 90 |
| Q3 | 127 |
| 95-th percentile | 183 |
| Maximum | 440 |
| Range | 434 |
| Interquartile range (IQR) | 89 |
Descriptive statistics
| Standard deviation | 54.618018 |
|---|---|
| Coefficient of variation (CV) | 0.61740287 |
| Kurtosis | -0.28277211 |
| Mean | 88.464148 |
| Median Absolute Deviation (MAD) | 45 |
| Skewness | 0.39513864 |
| Sum | 275123.5 |
| Variance | 2983.1279 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28 | 41 | 1.3% |
| 111 | 39 | 1.3% |
| 113 | 38 | 1.2% |
| 15 | 34 | 1.1% |
| 31 | 33 | 1.1% |
| 24 | 33 | 1.1% |
| 18 | 33 | 1.1% |
| 20 | 33 | 1.1% |
| 122 | 32 | 1.0% |
| 23 | 30 | 1.0% |
| Other values (217) | 2764 |
| Value | Count | Frequency (%) |
| 6 | 3 | 0.1% |
| 7 | 4 | 0.1% |
| 8 | 14 | |
| 9 | 15 | |
| 10 | 28 | |
| 11 | 26 | |
| 12 | 29 | |
| 13 | 28 | |
| 14 | 30 | |
| 15 | 34 |
| Value | Count | Frequency (%) |
| 440 | 1 | |
| 289 | 1 | |
| 278 | 1 | |
| 259 | 1 | |
| 251 | 1 | |
| 248 | 2 | |
| 243 | 1 | |
| 240 | 1 | |
| 230 | 1 | |
| 227 | 1 |
density
Real number (ℝ)
High correlation 
| Distinct | 833 |
|---|---|
| Distinct (%) | 26.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.99485181 |
| Minimum | 0.98711 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 0.98711 |
|---|---|
| 5-th percentile | 0.9896145 |
| Q1 | 0.9925 |
| median | 0.99546 |
| Q3 | 0.9971375 |
| 95-th percentile | 0.9989 |
| Maximum | 1 |
| Range | 0.01289 |
| Interquartile range (IQR) | 0.0046375 |
Descriptive statistics
| Standard deviation | 0.0028979734 |
|---|---|
| Coefficient of variation (CV) | 0.0029129699 |
| Kurtosis | -0.81420762 |
| Mean | 0.99485181 |
| Median Absolute Deviation (MAD) | 0.00206 |
| Skewness | -0.42680203 |
| Sum | 3093.9891 |
| Variance | 8.3982496 × 10-6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.9976 | 37 | 1.2% |
| 0.9972 | 37 | 1.2% |
| 0.9968 | 37 | 1.2% |
| 0.9984 | 35 | 1.1% |
| 0.998 | 32 | 1.0% |
| 0.9964 | 29 | 0.9% |
| 0.9962 | 28 | 0.9% |
| 0.9978 | 27 | 0.9% |
| 0.997 | 27 | 0.9% |
| 0.9974 | 25 | 0.8% |
| Other values (823) | 2796 |
| Value | Count | Frequency (%) |
| 0.98711 | 1 | |
| 0.98722 | 1 | |
| 0.9874 | 1 | |
| 0.98742 | 2 | |
| 0.98746 | 2 | |
| 0.98758 | 1 | |
| 0.98774 | 1 | |
| 0.98779 | 1 | |
| 0.98794 | 2 | |
| 0.98816 | 1 |
| Value | Count | Frequency (%) |
| 1 | 10 | |
| 0.9999 | 1 | < 0.1% |
| 0.9998 | 10 | |
| 0.99976 | 1 | < 0.1% |
| 0.99975 | 1 | < 0.1% |
| 0.99974 | 1 | < 0.1% |
| 0.99971 | 2 | 0.1% |
| 0.9997 | 8 | |
| 0.99966 | 1 | < 0.1% |
| 0.99965 | 1 | < 0.1% |
pH
Real number (ℝ)
| Distinct | 98 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2380161 |
| Minimum | 2.74 |
|---|---|
| Maximum | 4.01 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 2.74 |
|---|---|
| 5-th percentile | 2.97 |
| Q1 | 3.13 |
| median | 3.23 |
| Q3 | 3.35 |
| 95-th percentile | 3.52 |
| Maximum | 4.01 |
| Range | 1.27 |
| Interquartile range (IQR) | 0.22 |
Descriptive statistics
| Standard deviation | 0.16521968 |
|---|---|
| Coefficient of variation (CV) | 0.051024972 |
| Kurtosis | 0.36850724 |
| Mean | 3.2380161 |
| Median Absolute Deviation (MAD) | 0.11 |
| Skewness | 0.28198622 |
| Sum | 10070.23 |
| Variance | 0.027297542 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.26 | 94 | 3.0% |
| 3.16 | 89 | 2.9% |
| 3.22 | 88 | 2.8% |
| 3.2 | 86 | 2.8% |
| 3.36 | 83 | 2.7% |
| 3.24 | 83 | 2.7% |
| 3.18 | 79 | 2.5% |
| 3.15 | 78 | 2.5% |
| 3.23 | 77 | 2.5% |
| 3.14 | 77 | 2.5% |
| Other values (88) | 2276 |
| Value | Count | Frequency (%) |
| 2.74 | 1 | < 0.1% |
| 2.79 | 1 | < 0.1% |
| 2.8 | 1 | < 0.1% |
| 2.82 | 1 | < 0.1% |
| 2.83 | 4 | 0.1% |
| 2.85 | 3 | 0.1% |
| 2.86 | 7 | |
| 2.87 | 4 | 0.1% |
| 2.88 | 11 | |
| 2.89 | 5 |
| Value | Count | Frequency (%) |
| 4.01 | 2 | |
| 3.9 | 2 | |
| 3.85 | 1 | < 0.1% |
| 3.78 | 2 | |
| 3.76 | 1 | < 0.1% |
| 3.75 | 3 | |
| 3.74 | 1 | < 0.1% |
| 3.72 | 3 | |
| 3.71 | 4 | |
| 3.7 | 1 | < 0.1% |
sulphates
Real number (ℝ)
High correlation 
| Distinct | 108 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.57107395 |
| Minimum | 0.23 |
|---|---|
| Maximum | 2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 0.23 |
|---|---|
| 5-th percentile | 0.36 |
| Q1 | 0.46 |
| median | 0.55 |
| Q3 | 0.64 |
| 95-th percentile | 0.85 |
| Maximum | 2 |
| Range | 1.77 |
| Interquartile range (IQR) | 0.18 |
Descriptive statistics
| Standard deviation | 0.16587227 |
|---|---|
| Coefficient of variation (CV) | 0.29045672 |
| Kurtosis | 9.3880618 |
| Mean | 0.57107395 |
| Median Absolute Deviation (MAD) | 0.09 |
| Skewness | 1.9345483 |
| Sum | 1776.04 |
| Variance | 0.02751361 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.54 | 125 | 4.0% |
| 0.5 | 120 | 3.9% |
| 0.56 | 111 | 3.6% |
| 0.52 | 101 | 3.2% |
| 0.58 | 101 | 3.2% |
| 0.6 | 100 | 3.2% |
| 0.53 | 98 | 3.2% |
| 0.48 | 92 | 3.0% |
| 0.57 | 86 | 2.8% |
| 0.55 | 84 | 2.7% |
| Other values (98) | 2092 |
| Value | Count | Frequency (%) |
| 0.23 | 1 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.26 | 3 | 0.1% |
| 0.27 | 6 | 0.2% |
| 0.28 | 2 | 0.1% |
| 0.29 | 5 | 0.2% |
| 0.3 | 9 | |
| 0.31 | 15 | |
| 0.32 | 11 | |
| 0.33 | 17 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 1.98 | 1 | < 0.1% |
| 1.95 | 2 | |
| 1.62 | 1 | < 0.1% |
| 1.61 | 1 | < 0.1% |
| 1.59 | 1 | < 0.1% |
| 1.56 | 1 | < 0.1% |
| 1.36 | 3 | |
| 1.34 | 1 | < 0.1% |
| 1.33 | 1 | < 0.1% |
alcohol
Real number (ℝ)
High correlation 
| Distinct | 85 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.645598 |
| Minimum | 8.4 |
|---|---|
| Maximum | 14.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 8.4 |
|---|---|
| 5-th percentile | 9.1 |
| Q1 | 9.6 |
| median | 10.5 |
| Q3 | 11.4 |
| 95-th percentile | 12.8 |
| Maximum | 14.9 |
| Range | 6.5 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 1.2112905 |
|---|---|
| Coefficient of variation (CV) | 0.11378323 |
| Kurtosis | -0.58775189 |
| Mean | 10.645598 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | 0.53122813 |
| Sum | 33107.81 |
| Variance | 1.4672248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.5 | 195 | 6.3% |
| 9.4 | 168 | 5.4% |
| 9.2 | 123 | 4.0% |
| 11 | 117 | 3.8% |
| 9.8 | 113 | 3.6% |
| 10.5 | 107 | 3.4% |
| 10 | 93 | 3.0% |
| 11.2 | 88 | 2.8% |
| 9.6 | 87 | 2.8% |
| 10.4 | 86 | 2.8% |
| Other values (75) | 1933 |
| Value | Count | Frequency (%) |
| 8.4 | 4 | 0.1% |
| 8.5 | 2 | 0.1% |
| 8.6 | 2 | 0.1% |
| 8.7 | 17 | 0.5% |
| 8.8 | 31 | 1.0% |
| 8.9 | 16 | 0.5% |
| 9 | 60 | |
| 9.05 | 1 | < 0.1% |
| 9.1 | 71 | |
| 9.2 | 123 |
| Value | Count | Frequency (%) |
| 14.9 | 1 | < 0.1% |
| 14.2 | 1 | < 0.1% |
| 14.05 | 1 | < 0.1% |
| 14 | 9 | |
| 13.9 | 2 | 0.1% |
| 13.8 | 2 | 0.1% |
| 13.7 | 3 | 0.1% |
| 13.6 | 13 | |
| 13.55 | 1 | < 0.1% |
| 13.5 | 6 |
quality
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.7874598 |
| Minimum | 3 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.6 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 5 |
| median | 6 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 8 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.83168806 |
|---|---|
| Coefficient of variation (CV) | 0.1437052 |
| Kurtosis | 0.22040941 |
| Mean | 5.7874598 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.16564223 |
| Sum | 17999 |
| Variance | 0.69170503 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 1397 | |
| 5 | 1060 | |
| 7 | 482 | 15.5% |
| 4 | 90 | 2.9% |
| 8 | 68 | 2.2% |
| 3 | 13 | 0.4% |
| Value | Count | Frequency (%) |
| 3 | 13 | 0.4% |
| 4 | 90 | 2.9% |
| 5 | 1060 | |
| 6 | 1397 | |
| 7 | 482 | 15.5% |
| 8 | 68 | 2.2% |
| Value | Count | Frequency (%) |
| 8 | 68 | 2.2% |
| 7 | 482 | 15.5% |
| 6 | 1397 | |
| 5 | 1060 | |
| 4 | 90 | 2.9% |
| 3 | 13 | 0.4% |
Interactions
Correlations
| alcohol | chlorides | citric acid | density | fixed acidity | free sulfur dioxide | pH | quality | residual sugar | sulphates | total sulfur dioxide | type | volatile acidity | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| alcohol | 1.000 | -0.407 | 0.098 | -0.667 | -0.206 | -0.026 | 0.107 | 0.481 | -0.156 | -0.015 | -0.102 | 0.262 | -0.145 |
| chlorides | -0.407 | 1.000 | -0.076 | 0.650 | 0.566 | -0.447 | 0.272 | -0.310 | -0.187 | 0.448 | -0.518 | 0.738 | 0.574 |
| citric acid | 0.098 | -0.076 | 1.000 | 0.048 | 0.257 | 0.091 | -0.345 | 0.181 | 0.067 | 0.126 | 0.135 | 0.483 | -0.422 |
| density | -0.667 | 0.650 | 0.048 | 1.000 | 0.587 | -0.250 | 0.096 | -0.306 | 0.264 | 0.383 | -0.265 | 0.613 | 0.374 |
| fixed acidity | -0.206 | 0.566 | 0.257 | 0.587 | 1.000 | -0.421 | -0.117 | -0.116 | -0.134 | 0.394 | -0.467 | 0.587 | 0.328 |
| free sulfur dioxide | -0.026 | -0.447 | 0.091 | -0.250 | -0.421 | 1.000 | -0.279 | 0.111 | 0.332 | -0.321 | 0.798 | 0.496 | -0.452 |
| pH | 0.107 | 0.272 | -0.345 | 0.096 | -0.117 | -0.279 | 1.000 | -0.075 | -0.283 | 0.289 | -0.399 | 0.480 | 0.424 |
| quality | 0.481 | -0.310 | 0.181 | -0.306 | -0.116 | 0.111 | -0.075 | 1.000 | 0.068 | 0.045 | 0.013 | 0.192 | -0.341 |
| residual sugar | -0.156 | -0.187 | 0.067 | 0.264 | -0.134 | 0.332 | -0.283 | 0.068 | 1.000 | -0.195 | 0.426 | 0.553 | -0.190 |
| sulphates | -0.015 | 0.448 | 0.126 | 0.383 | 0.394 | -0.321 | 0.289 | 0.045 | -0.195 | 1.000 | -0.422 | 0.517 | 0.299 |
| total sulfur dioxide | -0.102 | -0.518 | 0.135 | -0.265 | -0.467 | 0.798 | -0.399 | 0.013 | 0.426 | -0.422 | 1.000 | 0.796 | -0.498 |
| type | 0.262 | 0.738 | 0.483 | 0.613 | 0.587 | 0.496 | 0.480 | 0.192 | 0.553 | 0.517 | 0.796 | 1.000 | 0.684 |
| volatile acidity | -0.145 | 0.574 | -0.422 | 0.374 | 0.328 | -0.452 | 0.424 | -0.341 | -0.190 | 0.299 | -0.498 | 0.684 | 1.000 |
Missing values
Sample
| type | fixed acidity | volatile acidity | citric acid | residual sugar | chlorides | free sulfur dioxide | total sulfur dioxide | density | pH | sulphates | alcohol | quality | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Moscatel | 8.1 | 0.24 | 0.32 | 10.5 | 0.030 | 34.0 | 105.0 | 0.99407 | 3.11 | 0.42 | 11.8 | 6 |
| 1 | Moscatel | 5.8 | 0.23 | 0.20 | 2.0 | 0.043 | 39.0 | 154.0 | 0.99226 | 3.21 | 0.39 | 10.2 | 6 |
| 2 | Moscatel | 7.5 | 0.33 | 0.36 | 2.6 | 0.051 | 26.0 | 126.0 | 0.99097 | 3.32 | 0.53 | 12.7 | 6 |
| 3 | Moscatel | 6.6 | 0.38 | 0.36 | 9.2 | 0.061 | 42.0 | 214.0 | 0.99760 | 3.31 | 0.56 | 9.4 | 5 |
| 4 | Moscatel | 6.4 | 0.15 | 0.29 | 1.8 | 0.044 | 21.0 | 115.0 | 0.99166 | 3.10 | 0.38 | 10.2 | 5 |
| 5 | Moscatel | 6.5 | 0.32 | 0.34 | 5.7 | 0.044 | 27.0 | 91.0 | 0.99184 | 3.28 | 0.60 | 12.0 | 7 |
| 6 | Moscatel | 7.5 | 0.22 | 0.32 | 2.4 | 0.045 | 29.0 | 100.0 | 0.99135 | 3.08 | 0.60 | 11.3 | 7 |
| 7 | Moscatel | 6.4 | 0.23 | 0.32 | 1.9 | 0.038 | 40.0 | 118.0 | 0.99074 | 3.32 | 0.53 | 11.8 | 7 |
| 8 | Moscatel | 6.1 | 0.22 | 0.31 | 1.4 | 0.039 | 40.0 | 129.0 | 0.99193 | 3.45 | 0.59 | 10.9 | 5 |
| 9 | Moscatel | 6.5 | 0.48 | 0.02 | 0.9 | 0.043 | 32.0 | 99.0 | 0.99226 | 3.14 | 0.47 | 9.8 | 4 |
| type | fixed acidity | volatile acidity | citric acid | residual sugar | chlorides | free sulfur dioxide | total sulfur dioxide | density | pH | sulphates | alcohol | quality | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3221 | Syrah | 6.6 | 0.725 | 0.20 | 7.8 | 0.073 | 29.0 | 79.0 | 0.99770 | 3.29 | 0.54 | 9.2 | 5 |
| 3222 | Syrah | 6.3 | 0.550 | 0.15 | 1.8 | 0.077 | 26.0 | 35.0 | 0.99314 | 3.32 | 0.82 | 11.6 | 6 |
| 3223 | Syrah | 5.4 | 0.740 | 0.09 | 1.7 | 0.089 | 16.0 | 26.0 | 0.99402 | 3.67 | 0.56 | 11.6 | 6 |
| 3224 | Syrah | 6.3 | 0.510 | 0.13 | 2.3 | 0.076 | 29.0 | 40.0 | 0.99574 | 3.42 | 0.75 | 11.0 | 6 |
| 3225 | Syrah | 6.8 | 0.620 | 0.08 | 1.9 | 0.068 | 28.0 | 38.0 | 0.99651 | 3.42 | 0.82 | 9.5 | 6 |
| 3226 | Syrah | 6.2 | 0.600 | 0.08 | 2.0 | 0.090 | 32.0 | 44.0 | 0.99490 | 3.45 | 0.58 | 10.5 | 5 |
| 3227 | Syrah | 5.9 | 0.550 | 0.10 | 2.2 | 0.062 | 39.0 | 51.0 | 0.99512 | 3.52 | 0.76 | 11.2 | 6 |
| 3228 | Syrah | 6.3 | 0.510 | 0.13 | 2.3 | 0.076 | 29.0 | 40.0 | 0.99574 | 3.42 | 0.75 | 11.0 | 6 |
| 3229 | Syrah | 5.9 | 0.645 | 0.12 | 2.0 | 0.075 | 32.0 | 44.0 | 0.99547 | 3.57 | 0.71 | 10.2 | 5 |
| 3230 | Syrah | 6.0 | 0.310 | 0.47 | 3.6 | 0.067 | 18.0 | 42.0 | 0.99549 | 3.39 | 0.66 | 11.0 | 6 |
Duplicate rows
Most frequently occurring
| type | fixed acidity | volatile acidity | citric acid | residual sugar | chlorides | free sulfur dioxide | total sulfur dioxide | density | pH | sulphates | alcohol | quality | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 191 | Moscatel | 7.0 | 0.15 | 0.28 | 14.70 | 0.051 | 29.0 | 149.0 | 0.99792 | 2.96 | 0.39 | 9.0 | 7 | 8 |
| 225 | Moscatel | 7.3 | 0.19 | 0.27 | 13.90 | 0.057 | 45.0 | 155.0 | 0.99807 | 2.94 | 0.41 | 8.8 | 8 | 8 |
| 231 | Moscatel | 7.4 | 0.16 | 0.30 | 13.70 | 0.056 | 33.0 | 168.0 | 0.99825 | 2.90 | 0.44 | 8.7 | 7 | 7 |
| 230 | Moscatel | 7.4 | 0.16 | 0.27 | 15.50 | 0.050 | 25.0 | 135.0 | 0.99840 | 2.90 | 0.43 | 8.7 | 7 | 6 |
| 12 | Moscatel | 5.7 | 0.22 | 0.20 | 16.00 | 0.044 | 41.0 | 113.0 | 0.99862 | 3.22 | 0.46 | 8.9 | 6 | 5 |
| 123 | Moscatel | 6.6 | 0.22 | 0.23 | 17.30 | 0.047 | 37.0 | 118.0 | 0.99906 | 3.08 | 0.46 | 8.8 | 6 | 5 |
| 140 | Moscatel | 6.7 | 0.16 | 0.32 | 12.50 | 0.035 | 18.0 | 156.0 | 0.99666 | 2.88 | 0.36 | 9.0 | 6 | 5 |
| 237 | Moscatel | 7.5 | 0.24 | 0.31 | 13.10 | 0.050 | 26.0 | 180.0 | 0.99884 | 3.05 | 0.53 | 9.1 | 6 | 5 |
| 13 | Moscatel | 5.7 | 0.22 | 0.22 | 16.65 | 0.044 | 39.0 | 110.0 | 0.99855 | 3.24 | 0.48 | 9.0 | 6 | 4 |
| 27 | Moscatel | 6.0 | 0.20 | 0.26 | 6.80 | 0.049 | 22.0 | 93.0 | 0.99280 | 3.15 | 0.42 | 11.0 | 6 | 4 |